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Abstract. Using various results from extremal set theory (interpreted in the lan- 
guage of additive combinatorics), we prove an asyptotically sharp version of Freiman's 
theorem in F£ : if A C F£ is a set for which \A + A\ ^ K\A\ then A is contained in 
a subspace of size 

2 2K+0(VKlogK)\ A \. except for thc 0(y/K log K) error, this is best 
possible. If in addition we assume that A is a downset, then we can also cover A 
by 0(AT 46 ) translates of a coordinate subspace of size at most \A\, thereby verifying 
the so-called polynomial Freiman-Ruzsa conjecture in this case. A common theme in 
the arguments is the use of compression techniques. These have long been familiar 
in extremal set theory, but have been used only rarely in the additive combinatorics 
literature. 



1. Introduction and statement of results 

If A, B are any two subsets of an abelian group G we write A + B := {a+b : a £ A, b £ 
B} for the collection of pairwise sums of A and B, and \ A\ for the cardinality of A. Define 
the doubling constant a (A) of a non-empty finite set A by a [A] : = ^rgj^ ■ A fundamental 
theorem of Freiman [5J in additive combinatorics asserts that non-empty finite sets of 
integers with small doubling constant have a special structure, namely that they can 
be efficiently contained in a generalized arithmetic progression. Similar results are 
known for sets in any abelian group [5], in which the notion of a generalized arithmetic 
progression is replaced by the slightly more general notion of a coset progression. See 
[T5| Chapter 5] for further discussion. 

The situation becomes particularly simple in the finite field model, in which the 
ambient group G has a fixed finite torsion. In this paper we shall concern ourselves 
exclusively with the two-torsion case 

G = ¥ I 2 := {(xi) ieI : Xi £ F 2 for all % £ /} 

where ¥2 is the finite field with two elements and / C Z + is a finite set of natural 
numbers; we abbreviate^F^ 1 ' "' n ^ as ¥%. If J C I then we embed ¥ 2 into F2 in the obvious 
manner (extending by zeroes), and so in particular ¥ 2 n C F2 whenever m < n. We refer 
to F2 as a coordinate subspace of ¥ ! 2 . The article [7] provides an extended discussion of 
the role of finite field models in additive combinatorics. As we shall soon see, additive 
combinatorics in this setting is also connected to more classical combinatorial problems 
in set systems. 



The first author is a Clay Research Fellow, and is pleased to acknowledge the support of the Clay 
Mathematics Institute. The second author is supported by a grant from the MacArthur Foundation. 

1 Of course, every F^ is isomorphic to for some n, but to avoid some tedious relabeling later on in 
the arguments it is convenient to work in the slightly this slightly more general setting when the label 
set I does not need to be an initial segment. 
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In the finite field model, the analogue of a generalized arithmetic progression or coset 
progression is an affine subspace (i.e. a translate of a linear subspace of F 2 ); note that 
these are the unique sets of doubling constant 1. We thus expect that sets of small 
doubling constant to be related to affine subspaces. To quantify this relationship, let 
us define two quantities. 

Definition 1.1 {F(K) and G{K)). Let K ^ 1. We define F(K) to be the least constant 
such that for any finite I C Z + and any non-empty A C F 2 with doubling constant 
cr[A] ^ K, there exists an affine subspace V C F 2 of cardinality \V\ ^ which 
contains A. 

We define G(K) to be the least constant such that for any finite / C Z + and any 
non-empty A C Fj with doubling constant a [A] ^ K, there exists a linear subspace 
V Q F 2 of cardinality \V\ ^ \A\ such that A is covered by at most G(K) translates of 
V. 

Thus for instance one can easily verify that F(l) = G(l) = 1, and that F and G are 
both non-decreasing in K. 

It is not immediately obvious that F and G are even finite. However, in pJ3] the first 
non-trivial upper bound on F was shown, namely F(K) < K 2 2 K for all K > 1. This 
was improved by Ruzsa (see [3]) to F(K) ^ K2^ K ^ 3 ~ 1 , by the first author and Ruzsa 
[8] to < K 2 2 2R2 ~ 2 and subsequently by Sanders [H] to ^ 2°^ K&/Ho ^ 1+K ^ . 

Some additional results for F(K) are known when K is small: see §|6l However, our 
primary interest here is with the large K case (e.g. K ^ 2). 

A lower bound for F(K) is not hard to obtain, as the following example shows. 

Lemma 1.2 (Independent points example). For all K ^ 1 we /jaue 

F(iT) ^ 2 2A ""° (log(1+x)) . 

Proof. Let r := [2-^ — lj, and consider the set A = {0, ei, . . . , e r } C F£. We have 
\A + A\ = \{r 2 + r + 2) and so <rL4] |(r + 1) ^ K. Furthermore A is not contained in 
any subspace of dimension less than r. Thus F(i^)|A| ^ 2 r , and whence \og 2 F(K) ^ 
r - log 2 \A\ = 2K - 0(log(l + K)) as desired. □ 

The first main theorem in this paper, which we prove in £j5j provides an upper bound 
to match this trivial lower bound. 

Theorem 1.3 (Asymptotic for F(K)). For all K ^ 2 we have 

F(K) = 2 2K+ °( y/WlosK \ 

Except for the issue of improving the bounds on the error term 0(\/K\ogK), this 
settles the question of determining F(K). Now we turn to the quantity G(K). This 
quantity is related to F(K) by the easily verified inequalities 

1 + log 2 F(K) ^ G(K) ^ 2F(K). 

In particular, Theorem 11.31 gives G{K) ^ 2 2K+0 ^ yr ^ losK ^ for K ^ 2. However, one can 
do better than this. The following conjecture was attributed to Marton in [13] and is 
sometimes known as the polynomial Freiman- Ruzsa conjecture (PFR). 

Conjecture 1.4 (Polynomial Freiman-Ruzsa conjecture). For any K ^ 1 we have the 
inequality G(K) K 0( - l \ In other words, any non-empty set A C F 2 is contained in 
the union of 0(K°^) cosets of a subspace of cardinality at most \A\. 
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It follows quickly from [10, Corollary 1.5] and Ruzsa's covering lemma (see e.g. [T5"j 
Lemma 2.14]) that G(K) < K°^\ This is the best upper bound currently known 
for G(K). 

The PFR conjecture allows one to pass from the "combinatorial information" \A + 
A\ ^ K\A\ to the "algebraic information" that A is contained in a union of cosets and 
back again with only polynomial losses in the constants. By contrast the use of results 
such as Theorem 11.31 entail the loss of an exponential, and this is necessary in view of 
Lemma ll.2[ Thus we see that the PFR conjecture would provide significantly more 
precise information on sets of small doubling constant than Theorem 11.31 

We are not able to prove the PFR conjecture in full generality here; however we will 
be able to verify this conjecture in the special case when A is a downset. To discuss 
this model setting we let (ej)j 6 j be the standard basis of F 2 , and let (, ) : ¥ 2 x F2 — > F 2 
be the standard bilinear form for which (ei,ej) = 5ij. If x G ¥ 2 then we will write 
Xi := (x, ti) G F 2 for the ith coordinate of x, thus x = J2iei Xie i- We wm wrr te x _L e { 
if Xi = 0. 

Definition 1.5 (Shift-minimal downsets). Let A C F 2 . We say that A is a downset 
if, whenever % G I, x _L and x + G A, we also have x G A. We say that A is 
shift-minimal if, whenever i < j are elements of J, x _L e^, x _L and x + e j G A we 
have x + Ci a A. If A is both a downset and shift-minimal then we say that it is a 
shift-minimal downset or ^MD for short. 

Remarks. One might usefully think of elements of F 2 as n-dimensional vectors with 
coefficients in {0, 1}. Then A C F 2 is a downset if it is closed under replacing ones by 
zeros, and it is shift-minimal if it is closed under shifting ones to the left. 

The terminology comes from the language of set systems, which motivated much of 
our work in this paper. Given a set X C / we may associate an element xx = Sjgx e j 
of F 2 . This is clearly a bijection: if x G F 2 we write X x = {i G / : x, = 1} for the 
subset of / associated to x. In this way we see that there is a one-to-one correspondence 
between sets A C¥ 2 and set systems on J, that is to say collections of subsets of /. We 
will not make much use of the set system language in this paper, but some readers may 
find it helpful to think in those terms. 

We can specialise Definition 11.11 to downsets or SMDs as follows. 

Definition 1.6 {F SMD (K), F D (K), G S md(K), and G D (K)). Let K > 1. We define 
-Fsmd(-^0 to be the least constant such that for any finite / C Z + and any non-empty 
SMD A C F 2 with doubling constant a [A] ^ K, there exists an affme subspace V C F 2 
of cardinality \V\ ^ Fsmd(-^)|^4| which contains A. 

We define Gsmd{K) to be the least constant such that for any finite / C Z + and 
any non-empty SMD A C F 2 with doubling constant c[A] ^ K, there exists a linear 
subspace V C F 2 of cardinality \V\ ^ \A\ such that A is covered by at most Gsmd(-^) 
translates of V. 

We define F B {K) and G D {K) similarly to F SMD (^) and G S md(^) but replace "SMD" 
with "downset" throughout. 

It is clear that F SM d(^) ^ F B {K) ^ F(K) and that G S md(^) ^ G D (K) <: G(K). 
In fact, in §|2]we shall use compression arguments to show 

Lemma 1.7 (Freiman's theorem is reducible to SMDs). We have F(K) = Fj)(K) = 
F 8 md(K) for all I. 
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In particular, in order to prove Theorem 11.31 it suffices to work in the SMD model. 
Unfortunately we are not currently able to obtain a similar relationship for G(K), 
Gd(K), and Gsmd(K). Nevertheless, we have the following result. 

Theorem 1.8 (PFR for SMDs). We have K 1Amm < G S mb(K) < K 30 for all K > 1. 

In fact we have the following slightly stronger statement. 

Theorem 1.9 (PFR for SMDs, again). If A is a SMD, then A is contained in a 

union of at most 0(cr[A] 30 ) translates ofF^ 2 • 

It is clear that the upper bound in Theorem 11.81 follows from Theorem 11.91 after 
performing an order-preserving relabeling to reduce to the ambient group Fg to F£ for 
some n. The lower bound comes by considering a Hamming ball whose radius is given 
by the solution to a certain optimization problem: the details of this computation are 
given in §H 

We also have a similar theorem for downsets. 

Theorem 1.10 (PFR for downsets). We have K 1Am01 < G a (K) < K m for allK^l. 

Again, we have a slightly stronger statement. 

Theorem 1.11 (PFR for downsets, again). If A C F2 is a SMD, then A is contained in 
a union of at most 0(<rL4] 46 ) translates of a coordinate subspace of cardinality at most 
\A\. 

We shall prove Theorems 11.91 and 11.111 (and hence Theorems 11.81 and 11.101) in §H It 
will be clear from the proofs that the exponents 30 and 46 can be improved, but we do 
not attempt to optimise them here. 

The idea of using methods of extremal set theory to look at Freiman's theorem orig- 
inated from our study of the work of Bollobas and Leader [2]. Our treatment of the 
basic theory of compression operators in the next section is closely based on their work. 

2. Compressions 

In this section we set out some notation for the machinery of compressions and related 
structures which we will use throughout the paper, and in particular to establish Lemma 
11.71 at the end of this section. Throughout this section I is understood to be a finite set 
of natural numbers. 

Definition 2.1 (Lex order). Suppose that x = J2iei XiCi anc ^ ^ na ^ V = Y^i^iVi e i- Then 
we define the lexicographical ordering (or lex ordering) -< by defining x -< y if and only 
if, for the largest coordinate j such that Xj 7^ yj, we have xj < yj. This is clearly a 
total ordering. An initial segment in Fg is any set of the form A := {x G F2 : x ^ y} 
for some i/GFj. 

The following lemma is well known. 

Lemma 2.2 (Adding initial segments of the lex order). Suppose that A and B are 
initial segments of the lex order on F^. Then so is A + B . 

Proof. This is easily proved by strong induction on \A\ + \B\. In fact if \A\ = r and 
\B\ — s and if ^ r, s < 2 k then the function HS(r, s) := \A + B\ satisfies the relations 

HS(2 fc + r,s) =HS(r, s) + 2 fc 
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and 

HS(2 fc + r,2 fe + s) =2 k+1 . 

(The letters HS stand for Hopf and Stiefel, who studied this function in a context arising 
in differential topology.) Its role in the additive combinatorics of F 2 was observed by 
Yuvinsky [TB]. By induction one may confirm (a result of Plagne [UJ) that 




□ 



We now define the notion of a compression with respect to a set of indices. 

Definition 2.3 (J-fibres). Suppose that A C F 2 and J C /. Then we may factor 
Wt, = F 2 V x F^ in the obvious manner. For each x G F 2 ^ J , define the J -fibre of A at x 
to be the set 

A x :={yE ¥ J 2 :(x,y)E A} C F 2 J . 

Definition 2.4 ( J-compressions). Suppose that J C I and that A C F 2 . Then we 
define the J -compression Cj(A) of A to be the set obtained by replacing each fibre A x 
by A x for all x G F 2 ^ J , the initial segment in the lex order on F 2 which has the same 
cardinality as A x . We say that A is J -compressed if Cj(A) = A. If ^ r ^ |/|, we say 
that A is r -compressed if A is J-compressed for all J C / with |J| ^ r. Similarly, we 
shall refer to a compression operator Cj with |J| ^ r as an r -compression. 

Example. If / := {1, 2, 3} and A := {ei, ei+e2, 63}, then C{i}(v4) = {0, e 2 , 63}, C{ 2 }(v4) = 
C 9 (A) = A, C{i i2 }(A) = {0,ei,e 3 }, and C{ li2j3 }(^) = {0,ei,e 2 }. The set {0,e 2 ,e 3 } is 
1-compressed but not 2-compressed. 

The following lemma is trivial: 

Lemma 2.5 (Trivial properties of compressions). Let J C. I. 

(i) If A C F 2; then \Cj(A)\ = \A\, and in particular Cj{A) C A iff Cj(A) D A iff 
Cj{A) = A. 

(ii) If AC. F 2 , t/ien Cj(Cj(/1)) = Cj(A). In oi/ier words, Cj(A) is J-compressed. 

(iii) IfACBCF^ then Cj{A) C Cj(S). 

When the set J has size at most 2, or is equal to / one can describe compressions 
quite explicitly: 

Examples (2-compressions and the full compression). The compression C$ is trivial: 
C$(A) = A for all A. When J = {i} the effect of the compression C^y is to move x + e,, 
where x J_ e i; to x whenever possible. Suppose that J = {i, j} with i < j and that A is 
already a downset. Then the effect of the compression Cn^x is to "shift" x + ej, where 
x _L ej and x J_ e^, to x + whenever possible. If J = I, then C/(A) is simply the 
initial segment of F 2 with length \A\. 

From the above description of compressions we immediately obtain 

Lemma 2.6 (Compressed sets). Let A C F 2 . 

(i) A is always ^-compressed. 

(ii) A is 1- compressed if and only if it is a downset. 

(iii) A is 2-compressed if and only if it is an SMB. 

(iv) A is \I\- compressed if and only if it is an initial segment o/F 2 . 
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Now, we recall the well-known observations that compressions do not increase the 
size of sumsets. 

Lemma 2.7 (Compressions and sumsets). Suppose that A, B C and that J C I. 
Then Cj{A) + Cj(B) C Cj{A + B). In particular \Cj{A) + Cj{B)\ ^\A + B\. 

Proof. We follow Bollobas and Leader [2J. We proceed by induction on J. If J ^ I then 
we work fibrewise. For any two fibres A x , B y C F2 we have, by the inductive hypothesis, 
that 

A X + By C y4 X + .By. 

Thus for any z G F^" 7 we have 

(cm^ + cmb))^ |J (a. + b,) 

C |J A x + B y 

x+y=z 

= J (A X + By) 

x+y=z 

= (A + B) Z 
= Cj(A + B) z , 

the middle step following from the fact that the initial segments of the lex order are 
totally ordered under inclusion. It remains to deal with the case I = J. By applying 
the inductive hypothesis and Lemma [2.121 we see that it suffices to deal with the case 
in which both A and B are invariant under compressions Cj> with J' C J. 

By an order-preserving relabeling we may take I — {1, ... ,n} for some n. From the 
compression-invariance assumption we easily confirm that each of A and B is either an 
initial segment of the lex order or else is equal to the special set 

S , :=ir 1 \{ ei + ... + e n _ 1 }U{e n }, 
a very slight perturbation of an initial segment of lex. Clearly we have Ci(S) = F%~ . 
This easily implies that Ci(S) +Cj(S) C Ci(S + S), so let us assume that A is an initial 
segment of lex and that B = S, so that our task is to prove that 

A + Fg -1 C d{A + S). 

If AC Fg -1 then the left-hand side is just F^ 1 whilst the right contains d(S) = F^ 1 . 
If FJp 1 C A then the left-hand side is F^. A small amount of thought confirms that 
A + S = ¥2 also, thereby concluding the proof. □ 

As a corollary of Lemma 12.71 and Lemma I2.5( i) we obtain 

Corollary 2.8 (Compression decreases doubling constants). Let J C I and let iCFj 
be non-empty. Then a[Cj(A)} ^ cr[A]. 

Another immediate corollary of Lemma 12.71 and Lemma I2.5( i) is 

Corollary 2.9 (Compressed sets closed under addition). Let J C I. Then the sum of 
two J -compressed sets is still J -compressed. In particular, from Lemma Wlh we see that 
the sum of two downsets is a downset, and the sum of two SMDs is an SMD. 



In fact, for downsets one can express sumsets in terms of lattice operations. 
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Definition 2.10 (Lattice structure). If x, y G F2 then we write 

x V y := max(xj, ^)e, and sc A y := min(a; i , j/^e*. 

(In the set systems language, these operations correspond to set union and set intersec- 
tion respectively.) If If A, B C Fg then we write A\/ B := {x \/ y : x €2 A, y E B}. 

Lemma 2.11 (Lattice representation of downset addition). If A,B are downsets then 
A + B = Ay B. 

Proof. If x G A and y G B, we observe that x + y = x'\/y', where x' G A and y' G B are 
formed from a;, y by zeroing out those coordinates i for which Xj = yi = 1. This shows 
that A + BCivB. The identity x V y = x + y' confirms that AV B C A + B. □ 

The next lemma states that by applying a suitable sequence of compressions to a 
set A one may obtain a set which is inert with respect to the application of further 
compressions. 

Lemma 2.12 (Termination of compression sequences). Let r, 1 ^ r ^ |/|, be an integer 
and let A C F|. TTien t/iere exists an r-compressed set A' which can be obtained from 
A by applying a finite number of r- compressions. 

Proof. By the principle of infinite descent, it suffices to exhibit a weight function 
w : P(F^) — > Zj> with the property that w(Cj(A)) < w(A) whenever Cj is an re- 
compression and Cj(A) 7^ A. It is then clear that one may find a sequence I±, . . . , 1^ 
with the required properties and with k ^ w(A). There is a wide choice of possible 
weight functions. One is 

W (A) :=J2 2P(X \ 

where p(x) is the position of x in the lex order on Fj. □ 
Next, we relate compression to Hamming balls. 

Definition 2.13 (Hamming balls). If x G Fg then we define the Hamming norm of x 
as ||x|| := \{i £ I : Xi = If ^ r ^ |/| then we define the Hamming ball 

H r (I) := {x G F2 : ||x|| ^ r}. 

We abbreviate H r ({l, . . . ,n}) as H r {n). 

Example. We have Hi{n) = {0, e±, . . . , e n }. 

Proposition 2.14 (Compression and Hamming balls). Let ^ r ^ |/|. 

(i) If ^ r ^ |/|, then H r (I) is 2-compressed. 

(ii) If H r (I) C ACFj and Cj is a 2-compression, then H r (I) C Cj{A). 

(iii) If A C H r (I) and Cj is a 2-compression, then Cj(A) C H r (I). 

Proof. One easily checks that H r (I) is an SMD, and part (i) then follows from Lemma 
12.61 Parts (ii) and (iii) then follow from Part (i) and Lemma [2.5( iii). □ 

Remark. Observe that the Hamming balls fail to be 3-compressed in general. Because of 
this fact, we will only work with 2-compressions for the rest of the paper. We have only 
included the higher compression operators because (for example) Lemma 12.71 seems 
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to be most naturally proven in this more general context, and because these higher 
compressions may be of use in future applications. 

Finally, we give some relations between subspaces and the SMD property. 

Lemma 2.15 (Compressions and subspaces). Suppose that A C F 2 is a SMD and that 
H C Fj is an affine subspace of dimension d. Let Id be the initial segment of I of length 
d. 

(i) For any J C I , Cj(H) is an affine subspace of dimension d. 

(ii) H is an SMD if and only if H = F 2 d . 
(hi) IfHCA thenF 1 ^ C A. 

(iv) IfACH then A C F$ d . 

(v) If A = H then A = F^. 

Proof. Part (i) can be proven directly. Alternatively, one may use Corollary 12.81 CAA) 
has doubling constant 1 and is therefore an affine subspace, which by cardinality con- 
siderations must have dimension d. 

The "if" direction of (ii) is trivial. To prove the "only if" direction, observe from rank 
considerations that H must have a full projection onto F 2 for at least one set J C I 
with I J\ = d. In other words, H contains an element with at least d non-zero entries. 
Using the SMD property this implies that H contains ^2 ieId e«, and using the downset 

property we conclude that H contains F 2 d . By cardinality or dimension considerations 
we conclude H = F 2 d as desired. 

Parts (iii) and (iv) follow from parts (i), (ii) and Lemma 12.121 (as well as Lemma 
I2.6( iii)). Part (v) follows immediately from parts (iii) and (iv). □ 

We are now in a position to quickly prove Lemma II .71 

Proof of Lemma [7T7[ The claim Fsmb(K) ^ Fd(K) ^ F(K) is trivial, so it suffices to 
show that F(K) ^ Fsmd(-^)- Suppose that A C F 2 has doubling at most K; we wish to 
enclose A in an affine subspace of cardinality at most Fsmd(-^)|^4|- By translation we 
may make A contain the origin, in which case the affine subspace is a linear subspace. 
The claim is then equivalent to the assertion that the linear space spanned by A has 
dimension at most log 2 (F SM D(-ft')|^4|)- 

Suppose that the claim failed. Then we could find linearly independent elements 
Vi, . . . , v n in A with n > log 2 (i 7 sMD(-^)|^4|); an d such that A is contained in the span 
of Vi, . . . ,v n . By a linear transformation (and a relabeling of I) we may assume that 
I D {1, . . . , n} and Vi = for alii = 1, . . . , n. Note that A is contained in the span 
of ei, . . . , e n , i.e. in F2, and so without loss of generality we may take / = {1, . . . , n}. 
Thus A now contains the Hamming ball Hi(n). 

Applying Lemma [2.121 (and Lemma l2.6( iii)) we may form an SMD A' by applying a 
finite number of 2-compressions to A. By Lemma I2.15( i) we see that A' still contains 
#i(n). By Corollary EH] we have a[A'] ^ a[A] < K. By Definition O we thus 
see that A' is contained in an affine space H of dimension at most \og 2 (F SMD (K)\A\) . 
Applying Lemma I2.15( iv) we see that A' is contained in a space F^ for some m ^ 
log 2 (i 7 sMD(-ft')|^4|) < n. But this contradicts the fact that A' contains Hi(n). □ 
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3. SUMSETS AND HAMMING BALLS 

Throughout this section we fix a finite set / of natural numbers. We now use com- 
pression technology to obtain some relationships between sumsets and Hamming balls 
for arbitrary subsets of F2 (not necessarily downsets or SMDs). These results, though 
somewhat diverse, all confirm that the Hamming ball H r (n) is a very unstructured set 
from the additive point of view. We begin with an application of 1-compressions which 
shows that large subsets of Hamming balls must expand under sumsets. 

Proposition 3.1 (Expansion in Hamming balls). Suppose that A,B(1 H r (I) for some 
r^O. Then \A + B\ ^ 9~ r \A\\B\. 

Proof. We are indebted to Oliver Riordan for pointing out the following simple proof; 
our original proof was significantly more complicated. 

Applying 1-compressions to A and B (using Lemma 12.71 and Lemma 12 . 12f) it suffices 
to consider the case when A and B are 1-compressed, and hence (by Lemma l2.6( ii)) 
downsets. By Lemma [2.111 it now suffices to show that \A V B\ ^ 9 _r |/l||.B|. 

To this end, suppose that x G A\/ B. Since x G H 2r (I), the number of representations 
of x as a V b is no more than 




The result follows immediately. □ 

Next, we use 2-compressions to show that sumsets of large sets must eventually escape 
Hamming balls. 

Proposition 3.2 (Expansion out of Hamming balls). Suppose that A, A', A" C F2 is 

such that 

\A\\A'\\A"\ ^ ( v / 5 + 2)- r 2 3 l / l (3.1) 
for some r ^ 0. Then A + A' + A" <2 H\i\- r (I). In other words, A + A' + A" contains 
a vector with at least \I\ — r ones. 

Proof. By an order-preserving labeling we may assume / = {1, ...,n}. Suppose for 
contradiction that we can find A, A', A" such that 

A + A' + A" C H n _ r {n). (3.2) 

Applying 2-compressions to A, A', A" (using Lemma 12.71 Lemma 12.121 and Lemma 
I2.14( i)) we see that we can reduce to the case when A, A', A" are all SMDs. By Lemma 
12.111 (and Corollary [2]9]), it now suffices to find x G A,x' G A',x" G A" such that 
\\x V x' V x"\\ > n — r. 

We therefore explore conditions under which there are x G A,x' G A',x" G A" for 
which ||x V x' V x"\\ is large. Let us imagine that ||x|| + ||a;'|| + counters have been 
placed on the points of /, the number of counters placed on % being X{ + x\ + x". Let 
us begin at i — n and try to redistribute these counters by moving them to the left in 
such a way that as many elements of / are covered by at least one counter. Thus for 
i — n, n — 1, n — 2, . . . in turn we pick up all of the counters at that place, and then put 
one counter down if we have one available in our hand. Write 5" for that set of points 
which have at least one counter at the end, and set z := ^ igS ej. By construction we 
see that 

z = x V x' V x", 
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where x is obtained from x by removing some ones from the coordinate vector of x 
and shifting others to the left, and similarly for x' and x". Since A, A', A" are SMDs it 
follows that leA.i'eA' and x 77 G A", and hence that z G A + A' + A". 

We define the defect d(x, x', x") to be the number of positions which have no counters 
when our process is finished, that is to say d(x,x',x") := n — \S\ = n — ||z||. 

Our aim is to estimate the number of triples (x, x', x") G (Fg) 3 for which d(x, x', x") ^ 
r. To do this, we first imagine a probabilistic variant of the above "moving counters 
to the left" game which takes place on Z<^ instead of [n]. The number of counters on 
each point 0,-1, —2, ... is chosen by tossing three coins, and placing a counter for each 
coin that comes up heads. All coin tosses are independent. Once more one imagines 
moving the counters to the left so as to cover as many points as possible. For n ^ 
we define the n- defect D n (which is a random variable) to be the number of positions 
among {— n, —(n — 1),...,— 1,0} which are left uncovered by our process of moving 
counters to the left. Define D = sup n>0 D n \ we will see shortly that this quantity is 
almost surely finite. Set q(r) := F(D ^ r), thus for instance q(0) = 1. 

Suppose we play the same game but starting from n = — 1 instead of n = 0, ignoring 
the position entirely. The analogous random variable D' has the same probability 
distribution as D, but obeys the relationship 

D ^ r + 1 if and only if D' ^ r + i 

for any fixed r ^ 0, where z'q is the number of counters at the position 0. Conditioning 
this relationship on the random variable io, we obtain the recurrence relation 

q(r + 1) = |g(r + 3) + |g(r + 2) + |g(r + 1) + |g(r) (3.3) 

for all r ^ 0. 

By conditioning upon the position of the r th uncovered counter after the completion 
of our process of moving counters to the left, one may also confirm the multiplicativity 
property 

q(r + s) = q(r)q(s) 

for all r, s ^ 1 (note that the claim is trivial if r = or s = 0). Thus q(r) = <?(l) r . The 
characteristic polynomial for the linear recurrence (13. 3p . x 3 + 3x 2 — 5x + 1, factors as 
(x — l)(x 2 + Ax — 1) and so (since one clearly has q(l) < 1) it follows easily that 

q(r) = {2 + V5)- r . (3.4) 

Now if chosen independently at random from (F2) 3 then 

F(d(x,x',x") ^ r) «C q(r) = (2 + V5)~ r . 

The result follows. □ 

Remarks. Our treatment is essentially taken from [3]. In that article upper bounds 
are obtained for the maximum size of 3-wise t-intersecting set systems: set systems 
T y J 7 ', T" such that \F n F' n F"\ ^ t for all F G F, F' G F', F" G F". Equivalently 
we have \F C U F' c U F" c \ ^ n — t. This condition is very similar to (13.21) . and is in fact 
equivalent to it in the case that F, F' and F" are downsets. 

An equivalent formulation of Proposition I3~2l is that if we have the largeness condition 
(pTTj) . then A + A' + A" + H r (I) = F^. We can now "bootstrap" this proposition to the 
case in which A, A', A" do not have to obey (13. ip . 
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\A + A' + A" + H r (n)\ 




A\ 1 / 3 \A'\ 


i/a\A»\ 


1/3 



Proposition 3.3 (Expansion result). Suppose that A, A', A" C Fg. T/ien 

| A + A' + A" + H r (I)\ ^ min (§(2 + V^IAl 1 ^'! 173 ^'? 73 , 2^ 
Proof. By relabeling we may take I — {1, ... ,n}. Write 

f(n) := min 

v ' A,A',A"C 

|A|| J 4'|| J 4"|^8(2+ v / 5)- r 2 3 ™ 

Claim 3.4. For all n we have 

f{n) > i(2 + v / 5) r '/ 3 . (3.5) 

Proof. We induct on n. Applying 2-compressions together with Lemma 12.12} Lemma 
12.71 and Proposition I2.14H ). we see that the minimum is attained when A, A', A" are 
2-compressed, and thus (by Lemma I2.6( iii)) SMDs. It may be that the minimum is 
attained for sets A, A', A" with |A| \A'\ \A"\ ^ (2 + V5)- r 2 3n . Otherwise (of course) the 
minimum is attained for sets with ^ (2 + \/5)~ r 2 3n . We call these cases 1 

and 2 respectively (these cases can overlap if the minimum is achieved multiple times). 

Suppose that we are in case 1. Split F™ as FIT 1 x F 2 in some arbitrary direction 
and write Aj := {x G F!?" 1 : (x,i) G A} for i = 0, 1. Define A'^A" similarly. Set 
p := |A |/|yi|,y := |t4' |/|v4| and p" := |Aq|/|A|. The downset property implies that 
p,p',p" ^ 1/2; assume without loss of generality that p" ^ p' ^ p ^ 1/2. 

Observe that A + A' + A" + H r (n) contains disjoint copies of Aq + A' Q + Aq + H r (n — 1) 
and A x + A' + Aq + H r (n - 1). Note also that 

(py')l/3 ( p l/3 + (1 _ p )l/3) > p + p 2/3(l _ p )l/3 ^ 1. 

Since we have 

|A||4,|L4£,| s= 8(2 + v^) - ^ 3 ^-^ 

for any choice of i, i', i" G {0, 1}, we see that f(n) ^ f(n — 1) in this case, and the claim 
follows by induction hypothesis. 

In case 2 we are in the situation covered by Proposition 13.21 It follows from that 
proposition that A + A' + A" contains a vector with at least n — r ones. It follows that 
A + A' + A" + H r {n) contains the vector ei + - ■ - + e n and hence, since A + A' + A" + H r {n) 
is an SMD, that 

A + A' + A" + H r {n) =F£. 

Thus 

on 

A") > | A |l/3| A ,|l/3| A1 l/3 > K 2 + V / 5) r/3 > 

as claimed in ( 13. 51) . □ 

We return now to the proof of Proposition [33 If ^ 8(2 + V5)- r 2 3n , then 

we have 

\A + A' + A" + H r (n)\ ^ |(2 + Vhy'^A^A^Af' 3 . 

If instead we have |Ap'p"| < 8(2+ v / 5) _r 2 3n we may apply Proposition ^. 2l to conclude 
that 

A + A' + A" + H r (n) = F™. 
In either case we conclude Proposition 13.31 □ 
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4. The polynomial Freiman-Ruzsa conjecture for SMD sets and 

DOWNSETS 

We use the above machinery, together with the standard tools of Pliinnecke-Ruzsa 
theory (see e.g. [T5l Chapter 2]) to prove Theorem 11.81 or equivalently Theorem 11.91 
We first prove a slight variant of this theorem, which will be useful for establishing all 
of our main theorems. 

Proposition 4.1 (PFR for SMD sets, intersection version). Suppose that A C W™ is a 
SMD and that a[A] < K. Set m := [log 2 \A\\ . Then 

\A<1F?\ > R- 29 \A\. 

Proof. Set ml := m + 51og 2 A + 1. By enlarging n if necessary we may assume that 
n = m' + t for some t > 0. Factor F™' +i = F™' x ^ m ' +1 '"" m ' + * } and consider, for 
each x £ F™', the fibre A x of A above x. Write B = in F™' for the "base", and let 
Ti : F™ +t — > F™' be the projection map. 

Suppose that A contains a vector with Hamming norm r. Since A is a SMD, we 
conclude that H r (m') C 5, and therefore by Proposition 13.31 we have 

|5 + 5 + 5 + 5| ^ + B + B + H r (m')\ ^ min (§(2 + y^) r/3 \B\, 2 m ') . (4.1) 

On the other hand, since A is a downset, we have C Aq for all x £ B, and so 

|A| < \Aq\\B\. 

Also, observe that B + B + B + B + A CA + A + A + A + A, and so 

|£ + B + B + B\\A \ ^\A + A + A + A + A\. 

On the other hand, from the Pliinnecke-Ruzsa inequality (see [T5l Corollary 6.29]) and 
the hypothesis \A + A\ if| A| we have 

+ A + A + A + K 5 \A\ 

From the preceding three inequalities^] we conclude 

\B + B + B + B\ ^ A 5 |.B|. 

The choice of m' guarantees that A 5 1 -B | ^ A 5 1 A | < 2 m ' , and so we are forced to conclude 
from flUD that 

i(2 + v / 5) r/3 ^ A 5 

and hence that 

r^l51og 2+V 5^ + 2. 

In other words, Aq is contained in if r (F 2 m+1 '"'' m+ ^), where r := [151og 2+v ^A" + 2]. 
Applying Proposition 13. II we have 

\{A + A) x \ ^ \A + A X \ > 9- r \A Q \\A x \, 

and therefore, summing over x, we see that 

\A + A\ ^ 9~ r |A ||A|. 

2 The observant reader will see that this argument can in fact be easily modified to the case when A 
is not a downset, simply by replacing Aq with A x where x maximises \A X \. 
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It follows immediately that 

\A Q \ <: 9 r K < K 1+15log 2+^ 9 ^ K 2i . 
Now A is a downset, and so A x C for every x. It follows immediately that 

\B\ > K~ 2A \A\. 

Recall that 5:=AnF™. The fact that A is a downset easily implies that | A n F2 1 / 2 J 
is a non-increasing function of j G N. Recalling the definition of m' in terms of m it 
follows that 

L4nF™| > K~ 29 \A\, 

as claimed. This proves Proposition I4.1[ and hence the upper bound of Theorem 11.81 □ 

Proof of TheoremU~R Set K := a[A], m := [\og 2 \ A\\ , and B := Af] F™. Applying 
Ruzsa's covering lemma (see e.g. [151 Lemma 2.14]) we may cover A by at most 
translates of B - B. But |A + S| «C \A + A\ < and B — B C while from 

Proposition 14. II we have |-B| 3> fC _29 |y4|. The claim follows. □ 

Now we turn from SMD sets to downsets. The key proposition is 

Proposition 4.2 (Small doubling implies escape from Hamming ball). Let A C be 

such that with o~[A] ^ K , and such that A + A + A C H r (n). Then r ^ log 2 \A\ — 
421og a tf-0(l). 

Proof. By applying 2-compressions (using Lemma [2.71 Corollary 12.81 Lemma [2.121 and 
Proposition 12. 14ft we may reduce to the case where A is 2-compressed, and hence (by 
Lemma 12. 6( iii)) an SMD set. 

Let m := [log 2 \A\\, and set B := A flFf. Applying Proposition 14. II we see that 

\B\ > K~ 29 \A\ > K- 29 2 m . 

Applying Proposition 13.21 we see that B + B + B contains a vector of norm at least 

m - 3 log^ +2 K 29 - 0(1) > log 2 \A\ - 42 log 2 K - 0(1). 

But B + B + B is contained in A + A + A, which lies in H r (n) by hypothesis. The claim 
follows. □ 

Proof of Theorem \1.11[ Let K := o~[A]. Let A be a downset with cr[A] ^ -ft". By 
Proposition 14.21 A + A + A contains a vector with Hamming norm at least log 2 \A\ — 
42 log 2 K - 0(1). By Corollary EH A + A + A is a downset, and so A + A + A contains 
a coordinate subspace F 2 of cardinality ^> _ft" _42 |A|. Now observe that 

46mrJi 



\A + ¥ J 2 \ < \A + A + A + A\ < AT 4 |A| < AT^F. 



2 



by the Pliinnecke-Ruzsa inequality (see [I5J Corollary 6.29]). Thus A can be covered by 
0(K 46 ) translates of F 2 . If |F 2 | ^ \A\ we are done, so suppose instead that |F 2 | > \A\. 
Then we have 

|F^| ^ \A + ¥i\ *C \A + A + A + A\ s= K A \A\ *C K A \¥ J 2 \ 

so that A can be covered by 0(K A ) translates of F 2 , which can in turn be covered by 
0(K A ) translates of a vector space of cardinality at most \A\. Thus in both cases we 
obtain Theorem 11.111 and the upper bound of Theorem 11.101 follows. □ 
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It remains to establish the lower bound in Theorems 11.81 and II. 101 Since Gsmd{K) ^ 
Gd(K) ^ G(K), it is clear that the lower bound in Theorem 11.81 is stronger. 

Proof that Gsmb(K) > K 1Am01 . Let a G (0,1/10] be a constant to be optimised 
later. Let A = H^ an ^(n) C F 2 ; this is of course a SMD. A standard computation using 
Stirling's formula shows that 

1^1 _ 2 (A(«)+o(l))n ) 

where h is the entropy function h(a) := —a log 2 a — (1 — a) log 2 (l — a). Since A + A = 
H 2 [an\(n) we have 

-log 2 0-L4] =h(2a) -h(a) + o(l). (4.2) 
n 

Set m := |_l°g2 I^IJ- Since A is a SMD, we see from Lemma [2.151 that if A is covered 
by t translates of a subspace of dimension m then it is in fact covered by t translates of 
F™. Thus if t[A] is the minimum possible value of t then we see that 

r[A] = \H an (n - m)\ 

and hence that 

1 ru 

- log 2 r[A] = (1 - M«))M I37 ^) + o(l). (4.3) 

Using Mathematica we found that the ratio between (14.31) and (14.21) is largest when 
a = 0.0939288 . . ., in which case it is 1.46601 . . . + o(l). □ 



Remarks. Proposition 13.21 immediately implies that if A C F 2 is a downset and if 
1^4 1 = ct2 n then A + A + A contains a subspace of dimension at least n — 0(log(l/a)). 
Proposition 14.21 implies a similar conclusion under the assumption that A is a downset 
with o~[A] ^ K: now A + A + A contains a subspace of dimension at least log 2 \A\ — 
0(}ogK). Results of a similar type, but with weaker dependencies, are known without 
the assumption that A is a downset. If now A is an arbitrary set and \A\ = a2 n then 
A + A + A contains an affine subspace of dimension n — 0(a c ), whilst if a [A] ^ K 
then A + A + A contains an affine subspace of dimension log 2 \A\ — 0(K C ). The first of 
these results is essentially due to Bogolyubov pQ (see also [151 Proposition 4.39]), whilst 
the second follows from the first together with Ruzsa's observation [12] that a set with 
doubling a[A] ^ K is Freiman 6-isomorphic to a subset A 1 C F™, where \A'\ ^ K~ 12 2 m 
(cf. [151 Lemma 5.26], as well as [7j). 

It might reasonably be conjectured that the bounds that we have obtained in the 
downset case hold in general. Thus it is possible that if A C F 2 has size \A\ = a2 n 
then A + A + A contains an affine subspace of dimension at least n — 0(log(l/a)). 
This conjecture, which we call the Polynomial Bogolyubov Conjecture, implies the PFR 
Conjecture, but seems to be strictly stronger than it. Our results verify this conjecture 
in the downset model. 

5. FREIMAN-TYPE THEOREMS IN F 2 

We now turn our attention to the proof of Theorem 11.31 We first need a preliminary 
result: 
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Proposition 5.1 (Freiman's lemma in finite fields). Suppose that A, B C ¥ 2 satisfy 
Hi(n) CBCA. Suppose also that \A\ ^ n c for some C ^ 1. TTien 

+ ^ (|n-0(Vnlogn)) 

where the implied constant depends only on C . 



Remark. Similar results, with the 0(y/n logn) replaced by something weaker such as 
o(n), are available under much weaker hypotheses, for example that \A\ ^ 2 n for 
some 5 > 0. This proposition should be compared with Freiman's lemma [6j, which 
asserts that if A is a subset of M d which is not contained in any proper affine subspace 
then 

\A + A\ ^ {d + l)\A\-\d{d+l). 

Proof. We may assume that n is large, since the claim is vacuously true otherwise. 
Suppose, as a hypothesis for contradiction, that the result is false. By applying 2- 
compressions (using Lemma 12.121 Lemma l2.5( iii). Lemma 12.71 and Proposition 12 .141 ) 
we may assume that A and B are SMDs. The fact that A is a downset, together with 
the upper bound on \A\, implies that every element of A has Hamming norm at most 
C\og 2 n, or in other words A C H C \ g 2 n( n )- Set M := [10Cs/nlog 2 n\. 

Claim 5.2. If x G A there cannot be more than one value of i ^ M such that Xi = 1. 

Proof. Suppose that there is some x G A such that there are two values of i ^ M such 
that Xi = 1. By shifting to the left, we see that H 2 (M) C A. We bound from below the 
number of distinct elements z = a + b with a G H 2 (M), b G B and a A b = 0. 
On the one hand the number of choices for the pair (a, b) is at least 

|B| ^M-Clog 2 ^ >(8 + o(i))c2|B|nlog , n 

On the other hand, no z arises in too many ways as such a sum a + b. Indeed since 

^ 2Clog 2 n, the number of choices for a is no more than (2Clog 2 n) 2 . 
Comparing these two bounds it follows that 

\A + B\ ^ (8 + o(l))C 2 (2C\og 2 n)- 2 \B\n\og 2 2 n > (1 - o(l))n\B\. 

This is contrary to assumption (for n large enough), and so the claim follows. □ 

For each i, M < % ^ n, write Bi := {x G B : x { = 1}. Set B := B \ (J< -Bi- N ° w 
A + B contains the sets 

e* + B : M < % < n 

and 

e { + Bj : M ^i,j ^n,i ^ j. 
No element lies in more than two of these sets, and therefore 

\A + B\ > \ ({n-M)\B \ + (n-M-l)J2\ B i\ J > n ~^~ - \B\. 

\ j=M J 

This concludes the proof of Proposition 15.11 □ 

Proof of Theorem 11.51 In view of Lemma 11.21 and Lemma 11.71 it suffices to show that 
^smd(^) ^ 2 2K +°^^ K \ 
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Let A be an SMD, and let n be the largest integer such that Hi(n) C A. Since A is 
a downset, we see that A C F^. It will thus suffice to show that n ^ log 2 \A\ + 2K + 
O(VKlogK). 

Let m := |_l°g 2 1^1 J an d factor F 2 = F™ x F 2 ' where n' := n — m. For each x G F™ 
write A x for the fibre 

^ := {ye¥^' :(x,y)eA}. 

Let -B be the "base" : 

B := AflF™ = {x : A x ^ 0}. 

Since A is a downset, we have A x C Ao for all x. Now A + A contains B + Aq, and 
therefore 

|B||A | = \B + Aq\ < XL4|. 
On the other hand from Proposition 14.11 we have 

|£| > K~ 29 \A\. 

Comparing these two bounds we see that \A \ <C f^ 30 . If n' ^ K then we are done, and 
if not the hypotheses of Lemma 15.11 are satisfied and so 

\A X + Aq\ Oiy/n'logn')^ \A X \ 

for all x. It follows immediately that 

K\A\ > \A + A\ 

X 

^ ' x 

> (y-O(V^logn')) \A\, 

and therefore 

n' 2 J Fr + 0(v / ^logiT). 
This concludes the proof. □ 



6. The small K case 

Most of our paper has been concerned with the behaviour of quantities such as F(K) 
and G(K) in the asymptotic limit K — > oo. For sake of completeness we make some 
remarks about the case of small K, which involves rather different techniques to those 
employed above. 

In [3] it was shown that F(K) ^ zk K r^~i ^ or ^ ^ ^ ^ T> t°g e fh er with a more 
complicated upper bound for F(K) in the region ^ < K < A. Here we give a precise 
formula for F(K) in the region 1 ^ K < |. 

Proposition 6.1 (F(K) for small K). If K < \, then F(K) = K, while F(K) = ffsT 
for ~ ^ K < |. (In particular, F is discontinuous.) 
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Proof. The lower bound F(K) ^ if (for any if) is clear by setting A to be a random 
subset of a large affine space V of density arbitrarily close to 1/if. The lower bound 
F{K) > f if for if > ~ follows by setting A to be a random subset of {0, ej, e 2 , e 3 } x F^ 3 
with n large of density arbitrarily close to 7^, where ei, . . . , e n is the standard basis of 

Now we show F(K) ^ if for if < ~, using an argument from (T5J Corollary 5.6]. 
From Kneser's theorem (see e.g. [TBI, Theorem 5.5]) we have | A + A\ ^ 2\A\ — |ii|, 
where if is the largest subspace of Fg with the property that A + A is a union of cosets 
of if. Since \A + A| < j\A\, we conclude that |A + A\ < 7\H\, thus A + A is the union 
of at most six cosets of H. In particular, if B := (A + H)/H C W%/H, then B + B 
has cardinality at most 6, and B + B cannot be expressed as the union of cosets of any 
non-trivial subspace of ¥%/H. Kneser's theorem again gives \B + B\ > 2\B\ — 1 and 
thus \B\ ^ 3. If \B\ = 2 or |S| = 3 then B + B is a subspace of dimension 1 or 2 
respectively, which is not permitted, so B is a singleton. Thus A is contained in a coset 
V of H. Since |V| = \A + A\ < if we obtain F(if) ^ if as desired. 

Now take | ^ if < |. Repeating the above argments with the same notation we 
obtain that \B + B\ < 9 and hence \B\ ^ 4. The only new case is when \B\ = 4, in 
which case B + B is contained in a subspace of dimension 3 and has cardinality exactly 
7. This implies that A is contained in an affine subspace of cardinality 8|if|. Since 
7\H\ = \A + A\ < K\A\, the bound F(if) ^ fif follows. □ 

Remark. A similar analysis can (in principle) be used to compute F(K) exactly for any 
given if < 2, though the number of cases to consider via this method grows quickly to 
infinity as if — > 2. 

The same analysis also gives precise values for G(K) when G is small. 

Proposition 6.2 (G(K) for small if). If 1 < if < \, then G S md(^) = G D (K) = 
G(K) = 2, while G SMD (K) = G D (K) = G(K) = 3 for \ < if < §. 

Proof. The lower bound Gsmd(K) ^ 2 for 1 < if < | can be established by setting 
A to be an initial segment in a large space F?? of density arbitrarily close to 1/if . The 
lower bound CrsMD(if) ^ 3 for | ^ if < | can be established by setting A to be the 
Cartesian product (in Fg x F 3 , = F^ 3 ) of an initial segment in a large space Fg of 
density arbitrarily close to j^, together with the set {0, e n +i, e n+ 2, e n+ 3}. 

Since Gsmd(K) ^ G D (if) ^ G(K), it thus suffices to establish the bounds G(K) ^ 2 
for if < | and G(K) ^ 3 for if < |. If if < |, then by the analysis used to 
prove Lemma [6.11 we see that A is contained in an affine space V of size \V\ ^ 
Subdividing V into two affine spaces of size at most K\A\/2 ^ \A\ we obtain the bound 
G(K) 2. (The case when \V\ = 1 can of course be dealt with by hand.) 

If if < |, the only new case is when A is the union of four cosets of H, where 
7\H\ ^ if \A\. In particular, we see that 2\H\ ^ || \A\ < \A\. Covering these four cosets 
of H by three cosets of a subspace of one higher dimension than H we obtain the bound 
G(K) ^3. □ 

Remark. While these above small examples show that Gsmd{K) = G-d(K) = G(K) for 
small if, we do not expect this relationship to hold exactly in general; it may be that 
there are non-downsets of reasonably small doubling constant which are significantly 
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harder to cover efficiently by subspaces than downsets or SMDs with the same doubling 
constant. Certainly the arguments used to prove Lemma [1 . 71 do not seem to extend to 
the G() family of quantities. On the other hand, Conjecture 11.41 would clearly imply 
that Gsmd(K), Gt>(K), and G(K) are equivalent up to polynomial factors. 
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